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DETECTION METHOD FOR C-RAF-1 GENES 



Field of the Invention 

-j o The present invention relates to ( 1 ) a method of 

identifying an individual at an increased risk for 
developing cancer, (2) a method for determining a 
prognosis of patients afflicted with cancer, and (3) a 
method for determining the proper course of treatment for 

15 a patient afflicted with cancer. 

Background Information 

Lung cancer claims more lives in the United 

20 States than any other neoplasm (R.S. Finley, Am. Pharm. 
NS29, 39 (1989)), and of the various forms lung 
adenocarcinomas have one of the worst prognoses (T.P. 
Miller, semin. Oncol. 17, 11 (1990)). The incidence of 
adenocarcinoma of the lung (ACL) in the United States is 

25 also quickly rising (I. Linnoila, Hematol. Oncol. North. 
Am. 4, 1027 (1990); J.B. Sorensen, H.H. Hansen, Cancer 
Surviv . 8, 671 (1989)). In order to gain insight into 
this complex and deadly disease, a model system for its 
study has been developed. For such a model to provide 

30 clinically relevant data several criteria must be met. 

The tumors produced should be histologically equivalent to 
their human counterparts, tumor induction must be reliable 
and reproducible, and the numbers generated must be great 
enough to provide statistical significance. To satisfy 

35 these conditions a system has been created which uses two 
inbred mouse strains (NFS/n and AKR) along with 
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15 



transplacental exposure to the potent carcinogen N-ethyl- 
N-nitrosourea (ENU) and promotion with the antioxidant 
butylated hydroxytoluene (BHT) . The resulting tumors were 
examined for altered expression or structural mutations of 
5 genes implicated in lung tumor development such as ras*. 
mvc, and raf oncogenes (CD. Little et al., Nature 306, 
194 (1983); P.E. Kiefer et al., Cancer Res., 47, 6236 
(1987); E. Santos et al., Science 223, 661 (1984); S. 
Rodenhuis, " gag] J. Med. 317, 929 (1987); M. Barbacid, 

io -~ a sua T " vest - 20 - 225 (1990); U - R ' Rapp et al " ^ 

a« SOC ." th fe stur dy Lung Cancer 4, 162 (1988); 

M.J. Birrer et al., Ann. Rev. Med. 40, 305 (1989); G. 
Sithanandam et al., Oncogene 4, 451 (1989)). 

raf proto-oncogenes are evolutionarily highly 
conserved genes encoding cytoplasmic serine/threonine 
specific kinases, which function in mitogen signal 
transduction (reviewed in U.R. Rapp et al., The Oncogene 
Handbook, T. Curran et al., Eds. (Elsevier Science 
Publishers, The Netherlands, 1988), pp. 115-154; U.R. 
Rapp, oncogens 6, 495 (1991)). The three known active 
members in the £af family • encode phosphoproteins of 
similar size (72/74 kD for Raf-1 ; 68 kD for A-Raf-1 , and 
74 kD for B-Raf (U.R- Rapp et al., in Fetrovi ruses and 
u„ ma „ P.hholoav . R. Gallo et al., Eds. (Humana Press, 
Clifton, New Jersey 1985), pp. 449-472; T.W. Beck et al., 
ap-Ms Res, 15, 595 (1987); G. Sithanandam et al., 
oncogene 5, 1775 (1990))). Raf-1 was first identified as 
the cellular homologue of v-£af (H.W. Jansen et al., 
30 Nature 307, 218 (1984)), the transforming gene of 3611 MSV 
"^TRapp et al., .t. Virol. 45, 914 (1983); U.R. Rapp et 
al , groc Natl, Ac-* Ssa - OSA 80, 4218 (1983)). Amino 
acid comparisons of ratf family genes shows three conserved 
regions [CR1 , CR2, CR3] (T.W. Beck et al., Nncleic Acids , 
35 Rej|_ 15, 595 (1987)); CR1 is a regulatory region 

surrounding a Cys finger consensus seguence, CR2 is a 
serine/threonine rich region, and CR3 represents the 



20 



25 
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kinase domain. Raf- 1 has been mapped to chromosome 3p25 
in humans (S.J. O'Brien et al., Science 223, 71 (1984)), 
and this region has been found to be frequently altered in 
small cell lung carcinoma (SCLC) (J. Whang-Peng et al., 
5 Cancer Genet, Cvtoaenet. 6, 119 (1982); J.M. Ibson et al. , 
j. Cell, Biochem. 33, 267 (1987)), familial renal cell 
carcinoma (A.J. Cohen et al., N. End. J. Med. 301, 592 
(1979); G. Kovacs et al., Tnt_ J. Cancer 40, 171 (1987)), 
mixed parotid gland tumors (J. Mark et al., Hereditas 96, 
10 141 (1982)), and ovarian cancer (K. Tanaka et al., Cancer 
Genet. Cvtoaenet. 43, 1 (1989)). 

Raf genes are differentially expressed in 
various tissues (S.M. Storm et al., Oncogene 5, 345 

15 (1990)). c-raf-1 has been found to be expressed 

ubiquitously, though absolute levels vary between tissues. 
A-raf-1 is present predominantly in the urogenital 
tissues, whereas B -Raf is most abundant in cerebrum and 
testis. The ubiquitous c-Raf-1 kinase is regulated by 

20 tyrosine and serine phosphorylations that result from 

activated growth factor receptor kinases (D.K. Morrison et 
al., Cell 58, 648 (1989); D.K. Morrison et al., Proc. 
Natl. Acad, sci, USA 85, 8855 (1989); K.S. Kovacina et 



al. , 


J. 


Biol . 


Chem. 


265, 


12115 


(1990); 


P. 


J. 


Blackshear et 


25 al., 


J. 


Biol. 


Chem. 


265, 


12131 


(1990); 


M. 


P. 


Carroll et 


al. , 


J. 


Biol. 


Chem. 


265, 


19812 


(1990); 


J. 


N. 


Siegel et al. , 



J. Biol. Chem. 265, 18472 (1990); B.C. Turner et al., 
Proc. Natl- Acad. Sci. USA 88, 1227 (1991); M. Baccarini 
et al., EMBO J. 9, 3649 (1990); H. App et al., Mol. Cell. 

30 Biol . 11,* 913 (1991)). Raf-1 operates downstream of Ras 
in mitogen signal transduction as judged by experiments 
using antibody microinjection (M.R. Smith et al., Nature 
320, 540 (1986)), c-raf-1 antisense expression constructs 
(W. Kolch et al., Nature 349, 426 (1991)), dominant 

35 negative mutants (W. Kolch et al., Nature 349, 426 

(1991)), and Raf revertant cells. Studies with NIH3T3 
cells and brain tissue demonstrated that mitogen treatm nt 
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20 
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30 



35 



induces Raf-1 kinase activity and causes a transitory 
relocation of the active enzyme from the cytoplasm to the 
nucleus and perinuclear area (Z. Olah et al. , Exp. Brain. 
Res, (in press); O.R. Rapp et al., in Cold Spring Harbor 

on or -** «-■«-< ™ Blolocxv, Vol. LIU, Eds. (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY 
1988) pp. 173-184). 

Raf-1 coupling has been examined in more than a 
dozen receptor systems and all strong mitogens stimulated 
Raf-1 kinase activity (U.R. Rapp, Oncogene 6, 495 (1991); 
7^ Morrison et al., Cell 58, 648 (1989); D.K. Morrison 
e ; al , Pras - Ac * Scj - USA 85, 8855 (1989); K.S. 

Kovacina et al. , ,T Biol. Chem,. 265, 12115 (1990); P.J. 
Blackshear et al. , J Bio! . Chem, 265, 12131 (1990); M.P. 
Carroll et al., .T Biol. Chem. 265, 19812 (1990); J.N. 
Siegel et al., J Bjol, Chem. 265, 18472 (1990); B.C. 
Turner et al. , rr~ — ^ ™* 88, 1227 (1991 ); 

H Baccarini et al., EMBO J. 9, 3649 (1990); H. App et 
al., r»n. Biol. 11, 913 (1991)), and this 

stimulation correlated with an increase in Raf-1 
phosphorylation leading to a shift in apparent molecular 
weight . 

SUMMARY OF THE INVENTION 

It is an object of this invention to provide a 
method of identifying an individual at an increased risk 
for developing cancer. 

It is another object of this invention to 
provide a method for determining a prognosis in patients 
afflicted with cancer. 

It is a further object of this invention to 
provide a method for determining the proper course of 
treatment for a patient afflicted with cancer. 
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Further objects and advantages of the present 
invention will be clear from the description that follows. 

In one embodiment, the present invention relates 
5 to a method of identifying an individual at an increased 
risk for developing cancer, comprising: 

amplifying a region of the c-raf-1 gene; 
analyzing products of the amplification for evidence 
of mutation; and 
TO classifying an individual having one or more 

mutations in the region as having an increased risk for 
developing cancer . 

In another embodiment, the present invention 
15 relates to a method for determining a prognosis in 
patients afflicted with cancer, comprising: 

amplifying a region of the c-raf-1 gene; 
analyzing products of the amplification for evidence 
of mutation; and 
20 classifying patients having no mutation in said 

region as being less likely to suffer disease relapse or 
having an increased chance of survival than those patients 
having one or more mutations in said region- 

25 in a further embodiment, the present invention 

relates to a method for determining the proper course of 
treatment for a patient afflicted with cancer, comprising: 

amplifying a region of the c-raf-1 gene; 

analyzing products of said amplification for evidence 

30 of mutation; 

identifying a patient having at least one mutation in 

said region, which patient may require treatment proper 

for patients having a lesser chance of survival or 

decreased time to relapse; and 
35 identifying a patient lacking mutations in said 

region, which patients may require treatment proper for 
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patients having a greater chance of survival or being less 
likely to suffer disease relapse. 

BRIEF DESCRIPTION OF THE DRAWINGS 



Figure 1 . Effect of BHT promotion on ENU 
tumorigenesis in NFS/n x AKR mice. The X-axis represents 
percent tumor induced mortality within each group, and the 
Y-axis reflects age in weeks. All animals were exposed to 
10 ENU transplacental^ at a dose of 0.5mM/Kg of mother's 
body weight on day 16 of gestation (presence of vaginal 
plug was scored as day one). At two weeks of age mice 
were weaned into two separate groups and separated by sex. 
Both groups were housed in identical cages and supplied 
15 with food (Purina Lab Chow) and water M libitum. 

Beginning at three weeks of age, group 2A (0) was given 
weekly intraperitoneal (i.p.) injection of corn oil (0.1 
ml), and group 2B (o) received weekly i.p. injections of 
BHT (20 mg/Kg of body weight) dissolved in corn oil. 
20 Administration of BHT reduces the mean age of mortality 
from approximately 20 weeks to 13, and decreases the 
initial age of mortality. These curves are significantly 
different (ps 0.001) as judged by a 2-tailed Cox test. In 
both groups the rate of tumorigenesis was identical for 
25 males and females. 

Figure 2. Northern blot analysis of proto- 
oncogene expression levels in ENU induced tumors. 

30 Figure 3. Diagnostic digestion of PCR amplified 

Ki-rajL genes from ENU induced tumors. Genomic DNA was 
isolated from a cesium chloride gradient during RNA 
preparations. In each case 10 ng was amplified via PCR 
(95°C, 5 min. followed by 35 cycles of 95°C, 1 min. -» 

35 55»C, 1 min. -» 72°, 1 min.) with 2 units of Tag I 
polymerase. The primers used (KI; 5'- 

AACTTGTGGTGGTTGGACCT— 3 ' -» (SEQ ID NO: 6) and K2; <= 3'- 
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GTCTTAGTGAAACACCTACT-5 ' ( SEQ ID NO: 7)) generate a 79 b.p. 
product. The primer Kl ends at codon 12 and contains a 
mismatch from normal mouse (Ki-ras sequence its 18th 
nucleotide (G -> C) creating a BstNI site (CCTGG) in 
5 conjunction with a normal codon 12 (GGT) . Digestion of 
amplified product from a normal allele with BstNI produces 
two products of 19 and 60 b.p., whereas a mutation in one 
of the first two positions of codon 12 will eliminate the 
BstNI site- The presence of two normal alleles results in 

10 all of the product being cleaved and the presence of one 
mutant and one normal allele will result in only half of 
the product cut. In the three panels each sample was run 
in duplicate, uncut and cut with BstNI. F1 is DNA from an 
untreated NFS/n X AKR F1 mouse, and MCA5 is a murine cell 

15 line known to harbor a mutant K- ras codon 12 allele. One 
lymphoma (24Ly) and one cell line (117; derived from a 
lung adenocarcinoma) display a mutated Ki-ras codon 12 
allele; however, 24Ly was a passaged tumor and examination 
of the original tumor showed two normal alleles indicating 

20 that this mutation was acquired during passaging. 

Figure 4. c-raf-1 RNAse protection analysis of 
ENU induced tumors . The probe used was a 32P labeled 
antisense transcript from the 3' non-coding region of a 

25 mouse c -raf- 1 cDNA to the 3' most StuI site. 

Hybridization of this probe with normal RNA results in a 
protected fragment of 1. 2kb covering the region encoding 
the Raf-1 kinase domain. One pig of poly(A)+ RNA from each 
tumor and 5 >ig of F1 RNA (in order to get comparable 

30 signals) was hybridized for 12 hours at 52°C with 200,000 
cpm of labeled mouse c -raf antisense transcript. 
Hybrids were then digested for 30 minutes with 25 )ig RNAse 
A and 33 units of RNAse T1 at room temperature. Digested 
hybrids were then, incubated with 50 >xg of proteinase K, 

35 phenol/chloroform extracted, ethanol precipitated, and 
resuspended in loading dye containing 80% formamide. 
Samples wer then run on 6% polyacrylamide denaturing 
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sequencing gels at 65 watts. Gels were vacuum dried at 80 
degrees C and exposed to X-ray film. Probe is undigested 
probe alone; tRNA is probe hybridized to non-specific RNA; 
v-raf is probe hybridized to RNA from a v-raf transformed 
5 cell line and the bands detected represent single base 
mismatches between murine c-raf and v-raf; NFS/AKR F1 is 
probe hybridized with RNA from a normal (untreated) mouse; 
24 LY is probe hybridized with RNA from a lymphoma; and 
the remaining lanes are probe hybridized with RNA isolated 
10 from lung tumors. 

Figure 5. Schematic of Raf-1 protein showing 
sites of ENU induced mutations. CR1 , CR2, and CR3 
represent conserved regions 1, 2 and 3. cDNAs were made 

15 from tumor derived P oly(A) + RNA using MoMuLV reverse 

transcriptase. Primers (MR1 sequence and MR2 sequence) 
encompassing a 435 base pair region c-raf were then used 
to amplify this region via PGR. The amplification mixture 
was then run on 1.7% agarose gels and the 435 bp P™duct 

20 isolated. This isolated fragment was then treated with T4 
polymerase and cloned into the Hindi site of M13m P 18 for 
sequencing. Another set of primers (EMR1 sequence and 
EMR2 sequence) was designed containing EcoRI sites at the 
termini and used to amplify a 609 base pair region 

25 (encompassing the original 435 base pair region). 

isolated products from these reactions were then digested 
with EcoRI and cloned into the EcoRI site of KS. 
Sequencing reactions were carried out using the Seguenase 
kit (USB) according to the recommended protocols for 

30 single and double stranded sequencing. Sequencing 

reactions were run on 6% polyacrylamide denaturing gels at 
65 watts. Gels were vacuum dried at 80 degrees C and 
exposed to X-ray film. In each case a normal allele was 
also sequenced along with the mutant allele. 
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Figure 6. Schematic for Identifying c-raf-1 
mutations. Primers 1 and 2 are shown in SEQ ID NO: 8 and 
SEQ ID NO: 9, respectively. 

5 DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to methods that 
involve amplifying a region of the c-raf-1 gene (the 
sequence of a mouse c-raf-1 gene is shown in SEQ ID NO: 10; 
10 the nucleotide and corresponding amino acid sequence of a 
human c-raf-1 gene is shown in SEQ ID NO: 11 and SEQ ID 
NO : 1 2 , respectively ) . 

In one embodiment, the present invention relates 

15 to a method of identifying an individual at an increased 
risk for developing cancer (preferably, lung cancer, T- 
cell lymphomas, renal cell carcinoma, ovarian carcinoma, 
and mixed parotid gland tumors) comprising: amplifying a 
region (preferably by using the polymerase chain reaction 

20 method(PCR) or by cloning techniques) of the c-raf-1 gene 
of the individual (SEQ ID NO: 11) (in one preferred 
embodiment, the region encodes amino acids 514 to 535 of 
SEQ ID NO: 12); analyzing products of the amplification for 
evidence of mutation (preferably by DNA sequencing of the 

25 region) and classifying an individual having one or more 
mutations in the region as having an increased risk for 
developing cancer. In one prefered embodiment, the region 
encodes amino acids 500 to 550 of SEQ ID NO: 12 or amino 
acids 450 to 630 of SEQ ID NO:12. In another prefered 

30 embodiment, the PCR method employs a primer comprising the 
sequence shown in SEQ ID NO: 7 and a primer comprising the 
sequence shown in SEQ ID NO: 8. In another prefered 
embodiment, the method comprises the steps shown in Figure 
6. 

35 in another embodiment, the present invention 

relates to a method for determining a prognosis in a 
patient afflicted with cancer (preferably, those cancers 
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listed abov ). The method comprises: amplifying the 
region of the c-raf-1 gene as described above; analyzing 
products of the amplification for evidence of mutation as 
described above; and classifying a patient having no 
mutation in the region as being less likely to suffer 
disease relapse or having an increased chance of survival 
than a patient having one or more mutations in the region. 

In another embodiment, the present invention 
relates to a method for determining the proper course of 
treatment for a patient afflicted with cancer (preferably, 
those cancers listed above), comprising: amplifying a 
region (described above) of the c-raf-1 gene as described 
above; analyzing products of the amplification for 
evidence of mutation as described above; identifying a 
patient having at least one mutation in the region, which 
patient may require treatment proper for patients having a 
lesser chance of survival or decreased time to relapse ; 
and identifying^ patient lacking mutations in the region, 
which patients may require treatment proper for patients 
having a greater chance of survival or being less likely 
to suffer disease relapse. 

Administration of therapeutic agents (cytotoxic 
or cytostatic) tailored to recognize the mutant Raf-1 
protein but not normal Raf-1 could specifically target 
tumor cells for death of growth inhibition. Such agents 
could be comprised of cytotoxic T-cells, antibodies, 
and/or specifically designed chemical compounds. 

The following Examples demonstrate consistent 
point mutations of the c-£afH proto-oncogene , within a 
small region of the kinase domain, in a mouse model for 
chemical tumor induction. This is the first demonstration 
of point mutated raf genes in vivo, and the first 
isolation of activating in viva point mutations in the 
kinase domain of a proto-oncogene. The tumors examined 
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show a selective specificity for Raf-1 mutations as 
another family of genes, the ras proto— oncogenes which are 
frequently activated by point mutation in both animal and 
human tumors (S. Rodenhuis et al., Am. Rev . Respir. Pis, 
142, S27-30; T.R. Devereux et al., Carcinogenesis 12, 299 
(1991)), is not involved. 

The present invention is described in further 
detail in the following non-limiting examples. 

EXAMPLES 

The following protocols and experimental details 
are referenced in the examples that follow: 

15 RNA Isolation. Tumors were excised, a small portion 
minced in PBS (phosphate buffered saline solution) for 
passaging in nude mice, frozen immediately in a dry 
ice/ethanol bath, and stored at -70° until RNA ■ extraction. 
Frozen tissues were minced on wet ice in a guanidine 

20 thiocyanate buffer (4M guanidine thiocyanate 10mM EDTA, 2% 
N-lauryl sarcosine, 2% beta-mercaptoethanol , 1 OmM Tris 
(pH=7.6)), disrupted in a Dounce homogenizer, and 
extracted three times with phenol: chloroform: isoamyl 
alcohol (24:24:2). Supernatants were then transferred to 

25 SW41 tubes, 100 }ig of cesium chloride per ml was added to 
the supernatant which was then underlayed with one half 
saturated cesium chloride in 1 0mM EDTA (pH=7.0; index of 
refraction 1.3995-1.4000), and centrifuged at 25,000 rpm 
for 20 hours in a Sorvall SW-41TI rotor using a Beckman 

30 model L5-50 ultracentrifuge. Supernatants were removed 
and RNA pellets dissolved in 4 ml resuspension buffer (10 
mM Tris-HCl pH=7.6, 5% beta-mercaptoethanol, 0.5% N-lauryl 
sarcosine, 10 mM EDTA), extracted once with 
phenol: chloroform: isoamyl alcohol, sodium acetate added to 

35 0.1 2M and RNA precipatated with two volumes ethanol at - 

20°C overnight. Precipitates were centrifuged at 9,000 rpm 
in a Sorvall SS-34 rotor for 30 minutes, and pell ts 
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redissolved in RNA sample buffer (10 mM Tris pH=7.4, ImM 
EDTA, 0.05% sodium dodecyl sulfate) and concentrations 
determined by absorbance at 260 ma. Poly (A) * RNA was 
isolated by binding to oligo dT cellulose columns in high 
5 salt (10mM Tris pH=7.4, ImM EDTA, 0.05% SDS, 500mM NaCl), 
and eluting with RNA sample buffer heated to 40 °C. 

llor&szn mating. 5 ug poly (A)* RNA per lane was ethanol 
precipitated, desiccated, resuspended in loading buffer 

10 (20mM MOPS pH=6.8, 5mM sodium acetate, ImM EDTA, 50% 

formamide, 6% formaldehyde), heated at 65°C for 5 win., 
quick chilled on wet ice for 10 min. , and electrophoresed 
through a 0.7% agarose gel containing 2.2 M formaldehyde, 
20mM MOPS [pH=6.8], 5mM sodium acetate, and ImM EDTA. 

1 5 Gels were then blotted overnight onto nitrocellulose 

filters via capillary transfer in 20X SSC, filters were 
washed in 3X SSC for 10 min. and baked at 80°C for 2 
hours . 

20 ^H^^Hons. Filters were prehybridized at 42 »C in 5X 
SSC, 50% formamide, 20mM sodium phosphate P H=6.8, 200 
ug/ml PVP-40, 200 ug/ml ficoll 400, 200 ug/ml bovine serum 
albumin, and 200 ug/ml sonicated sheared salmon sperm DNA. 
Blots were then hybridized with 500,000 cpm/ml of random 

25 primed *P labeled probes overnight at 42 °C in 

prehybridization solution with 5% dextran sulfate. Blots 
were washed with agitation in 2X SSC, 0.1% SDS at room 
temperature six times for 20 minutes each wash, then 
washed once at 45°C in 0.1X SSC for 15 minutes. Filters 

30 were exposed to X-AR 5 film at -70°C. 

EXAMPLE 1 

Tumor Tnriuction 



35 



NFS female mice were mated with AKR males and 
pregnant females given a transplacental injection of 1- 
ethyl-1-nitrosourea (ENU) at a dosage of 0.5 mM/Kg 
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mother's body weight on day 16 of gestation, counting plug 
date as day one. ENU was chosen for tumor induction since 
it is a very potent direct acting carcinogen capable of 
modifying any base in vivo (Singer, B. et al., 1983. 
5 Molecular Biology of Mutagens and Carcinogens, Plenum 

Press, New York). ENU alkylates all tissues with roughly 
the same efficiency (E . Scherer et al. r Cancer Lett, 46, 
21 (1989)) and has a very short half life in vivo (E.M. 
Faustman et al., Teratology 40, 199 (1989)) allowing 

10 specific mutagenesis of tissues which are mitotically 

active at a particular time. NFS and AKR were chosen as 
parental strains based on earlier studies which showed 
them to be particularly susceptible to ling tumors 
following ENU exposure (B. A. Diwan et al., Cancer Res. 34, 

15 764 (1974); S.L. Kauffman, JNCI 57, 821 (1976)). With 
this procedure nearly 100% of the offspring develop lung 
adenocarcinomas and approximately 70% develop, in 
addition, T-cell lymphomas with a mean latency of 
approximately 20 weeks. In order to achieve more rapid 

20 tumor development, weanling mice were treated with weekly 
injections of a tumor promoter, the antioxidant butylated 
hydroxytoluene or BHT (20mg/kg body weight dissolved in 
corn oil). BHT was used as it has been demonstrated- to 
cause lung lesions and hyperplasia when injected into mice 

25 (A. A. Marino et al., Prnc. Soc- Exp. Biol. Med. 140, 122 
(1972); H. Witschi et al., Proc. Soc. Exp. Biol. Mefl,, 
147, 690 (1974); N. Ito et al., CRC Crit. Rev. Toxicol. 
15, 109 (1984)). In the present system it nearly doubles 
the rate at which tumors develop. Figure 1 compares tumor 

30 induced mortality with age of animals for those receiving. 
ENU alone, and those receiving ENU and promoted with BHT. 
These curves demonstrate that when BHT is given the mean 
age of tumor induced mortality decreases from 
approximately 20 weeks to around 12, and there is also a 

35 decrease in initial latency. These curves are 

significantly different with a confidence limit greater 
than 99.99% using a 2-tailed Cox test. In addition, BHT 
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promotion, while increasing the rate at which tumors 
develop, does not affect the tumor spectrum. 



30 



EXAMPLE 2 
nnnoaenfi Expre ssion 



Northern blot analysis revealed elevated levels 
of c-raf-1, as compared to normal tissue, in every tumor 
examined (Figure 2), and Western blot analysis showed that 

10 protein levels correlated with message levels (U.R. Rapp 
et al., -■" ""^ a " H nancer. S.A. Aaronson et al., 
Eds. (Tokyo/VNU Scientific Press, Tokyo, 1987) pp. 55-74). 
in addition, in cell lines derived from primary tumors, 
Raf-1 protein kinase activity was shown by immune-complex 

15 kinase assays to be constitutive. Further analysis of 
other oncogenes revealed no consistent pattern of 
expression except for ras and myc family genes. In the 
case of the myc. family, one member (either c-, N-, or L 
mvc) was overexpressed but never more than one. For the 

20 ras genes, at least one member (Ki-, Ha-, or N-ras ) , and 
often more than one, was expressed at high levels when 
compared with the normal tissue. In addition all 
oncogenes examined via Northern analysis exhibited full 
length, normal sized transcripts. 

ras. genes were considered likely candidates for 
mutational activation since oncogenic forms of Ki-ras have 
previously been observed in lung tumors (S. Rodenhuis et 
al., M — gesgir. Pis. 142, S27-30; T.R. Devereux et 
al., carcinogsasais 12, 299 (1991)) and ENU is a point 
mutagen (Singer, B. et al., 1983. Molecular Biology of 
m.,-^* and n^Mnoaens. Plenum Press; New York). A 
systematic analysis of various ras codons known to be 
involved in oncogenic activation was therefore performed . 
Ha-, Ki-, and N— ras were examined at codons 12, 13, and 
61 for potential mutations via RNAse protection assays 
(R.M. Myers et al., SSienss. 230, 1242 (1985); B. Winter et 
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al., Proc. Natl- Acad. Sci . USA 82, 7575 (1985)), PCR 
amplification followed by subsequent sequencing (F. Sanger 
et al., .t Mnl, Biol. 13, 373 (1975)), and PCR 
amplification followed by diagnostic restriction digests 
5 (W. Jiang et al., Oncogene 4, 923 (1989)). PCR 

amplification creating diagnostic enzyme sites is a very 
efficient way of examining alleles for mutations at known 
sites and involves designing a PCR primer whose 3' end 
lies next to and produces a novel restriction site 

10 encompassing the codon of interest. Following 

amplification, PCR products from normal alleles will 
contain the new restriction site, while mutant alleles 
will not. Digestion of the product from tissue with two 
normal alleles results in all product being cut; however, 

15 if one allele contains a mutation, only half of the 

product will be digested. Figure 3 shows the results of 
amplification and diagnostic digestion applied to Ki-ras 
codon 12 in several tumors and cell lines. The first 
panel is from a set of lymphomas. F1 is DNA from a normal 

20 untreated mouse and both alleles are cut by EstNl, 

indicating the presence of two normal alleles. MCA5 is a 
murine cell line known to contain a Ki-ras codon 12 
mutation (L.F. Parada et al., Mol . Cell. Biol, 3, 2298 
(1983)), and only the amplified normal allele is cleaved. 

25 Of the five tumors shown in the second panel, one shows a 
mutant Ki-ra§. allele. The next panel shows some of the 
lung tumors tested and none of them exhibit a mutant 
allele, and the final panel shows tumor derived cell 
lines. The first three are from lymphomas and the last 

30 three from lung adenocarcinomas. One lung tumor line 

(#117) has a Ki-ras 12 mutation that was not present in 
the primary tumor but came up upon transplantation. This 
analysis has been performed with Ki, Ha and N-ras genes at 
codons 12 and 61 . Of all the tumors and cell lines 

35 examined by this method for mutations of the three ras 
genes at codons 12 and 61, the two shown here were the 
only ones detected. Examination of codon 13 was done by 
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10 



PCR amplification of genomic DNA surrounding codon 13 
followed by cloning into KS+ (Stratagene) and double 
stranded sequencing- Table I summarizes the ras mutation 
data. The most notable point from this table is the 
conspicuous lack of ras mutations in these tumors. In 
fact the number of ras. mutations is much lower than would 
be expected for a sampling of spontaneous tumors (S. 
Rodenhuis et al., ftm. Rev. Respir. Pis. 142, S27-30; T.R. 
Devereux et al., r^noaenesis 12, 299 (1991 ); J.L. Bos, 
cancer Res. 49, 4682 (1989)). Having eliminated ras genes 
as playing a primary role in the genesis of these ENU 
induced tumors, c-rjif-1 was investigated for possible 
small or point mutations. 



15 TABLE I 

Tumors *nri Cell L -^* Positive for ras Mutations 
Codon- 12 Codon 13 Codon 61 



20 





Tumors 


Cell 

Lines 


Tumors 


Cell 
Lines 


Tumors 


Cell 
Lines 


Ha-ras 
Ki/ras 
N-ras 


0/10 
1710 
0/10 


0/6 
1/6 
0/6 


0/6 
0/6 
0/6 


0/2 
0/2 
0/2 


0/10 
0/6 
0/10 


0/6 
0/2 
0/6 



• This was a second passage tumor in which the original 
25 tumor did not contain a Ki-ras mutation. 

Table I: Summary of mutation analysis for Ha-, Ki-, and 
N-ras. at codons 12, 13, and 61. Each box displays the 
number of mutations detected, over the number of tumors 
30 and tumor derived cell lines examined via RNAse 

protection, sequencing or diagnostic digestion, for each 
of the nine codons . 

KXAMPLE 3 

35 Mutatio ns in Raf-1 



Since no point mutations had been described 
for raf g nes in vivo, as had been for the ras genes (E. 
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Santos et al. f Science 223 , 661 (1984); S. Rodenhuis, JL 
Enol. J. Med, 317, 929 (1987); M. Barbacid, Eur. J, 
Clin. Invest. 20, 225 (1990); F. Sanger et al., J. Mol. 
Biol. 13, 373 (1975)), point mutations were screened for 
5 using RNAse protection assays (R.M. Myers et al., 

Science 230, 1242 (1985); E. Winter et al., Proc. Natl. 
Acad. Sci. USA 82, 7575 (1985)). Figure 4 shows a 
typical protection assay using a c-raf-1 probe. In this 
experiment the probe used covered the 3' end of raf-1 

10 from the 3' most StuI site to the end of the coding 

sequence. The first lane is a marker (pBR322 digested 
with Haelll), the second shows the probe alone 
undigested, the third lane shows the probe hybridized to 
unrelated RNA in this case tRNA, the fourth lane shows 

15 hybridization with v- raf transformed cells and the lower 
bands represent cleavage at points where the mouse c- 
raf-1 gene differs from v-raf. The fifth lane shows 
hybridization with RNA isolated from a normal lung of an 
untreated F1 mouse, the next lanes are RNA isolated from 

20 several tumors. In the case of the normal RNA, only 

one, fully protected, band is detected while in the case 
of the tumors two major bands are seen after digestion. 
20 out of 20 tumors analyzed in this fashion showed this 
extra band. These data demonstrate the following major 

25 points: 1 ) there is a tumor specific alteration in c- 
raf- 1 that results in a region of non-homology 
recognizable by either RNAse A or T1 ; 2) The alterations 
are confined to the same region of one allele as two 
bands of equal size are present in the tumor lanes, and; 

30 3) both alleles were expressed at comparable levels as 

both bands are of approximately equal intensity. In the 
assay shown 5 ug of poly (A) + RNA was hybridized from 
normal tissue, and 1 ug was used from the tumors. This 
was necessary to get signals that could be compared on 

35 the same gel due to the overexpression of c-raf-1 in the 
tumors. By running these assays with various markers it 
was possible to estimate the approximate site of the 
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alteration(s) to be in the vicinity of the exon 14/exon 
15 junction. In order to define the precise genetic 
alteration or alterations, PCR primers were designed 
which would generate a 600 bp fragment encompassing this 

5 region. cDNAs from tumor derived RNA were then 

amplified and cloned into KS + (Stratagene) for double 
stranded sequencing. The sequencing results from 
several tumors are shown in Figure 5. The top portion 
of Figure 5 presents a cartoon of the mouse Raf-1 

10 protein. There are three conserved regions CR1 , CR2 and 
CR3 with CR3 representing the kinase domain. The probe 
used in the RNAse protection assays covers the indicated 
area, and the PCR primers amplified the bracketed 
region. Sequencing through this area revealed a variety 

15 of mutations just downstream of the APE site. These 

mutants are shown in an expanded version at the bottom 
of Figure 5 (See also SEQ ID NO:1 for normal mouse 
sequence and SEQ ID NO:2, 3, 4, and 5 for mutant 
sequences). These mutants were isolated from four 

20 separate tumors, and in each case a normal allele (SEQ 
ID 110:1) was also sequenced. Repeating the cDNA 
synthesis, PCR amplification, cloning and sequencing 
gives the same sequence and normal tissue shows no 
mutations demonstrating that these alterations are not 

25 artif actual. Sequence covering the amplified region has 
been examined and it is interesting that all of these 
changes occur within a very small region of the raf 
protein. In fact the region where these mutations occur 
overlaps an epitope shared by monoclonal antibodies 

30 generated against raf (W. Kolch et al. , Oncogene 5, 713 
(1990)), and computer modeling of the protein shows thxs 
to be a hydrophilic domain, the structure of which is 
predicted to be altered by these mutations. This 
indicates a biologically important region for the 

35 molecule and indeed the first of these mutation tested 
in NIH3T3 cell assays, after cloning into a retroviral 
expression vector (E1-neo, (G. Heidecker et al., M<lU 
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Cell. Biol, 10, 2503 (1990))), was found to be weakly 
transforming when driven by a Moloney LTR. The 
transformation efficiency was comparable to EC2, a 
previously characterized mutation of human c-raf-1 cDNA 
5 (G. Heidecker et al., MoT . cell. Biol. 10, 2503 (1990); 
C. Wasylyk et al., ™n1 s Cell. Biol, 9, 2247 (1989)) and 
-20 fold lower than the v -raf oncogene. 

★ ****. 
10 All publications mentioned hereinabove are 

hereby incorporated in their entirety by reference. 

While the foregoing invention has been 
described in some detail for purposes of clarity and 
15 understanding, it will be appreciated by one skilled in 
the art from a reading of this disclosure that various 
changes in form and detail can be made without departing 
from the true scope of the invention and appended 
claims . 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Met Glu His lie Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 10 1S 
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Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys lie Ser Pro Thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 ' 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 HO 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 125 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 
130 135 140 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 175 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn He Arg Gin 
180 185 190 

leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
195 200 205 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 " 220 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 240 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 255 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 270 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
K 275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 
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Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 

Gly Ser Gly Thr Gin Gin Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 

340 345 
Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 

355 360 
His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 

370 375 
Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 

Arg His Val Asn lie Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 

420 425 
Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 

435 440 
Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 



450 



455 



n. Hi* Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 

thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 



485 



Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 

Ala Pro Glu Val He Arg Met Gin Asp Asp Asn Pro Phe Ser Phe Gin 
515 520 3 " 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 

530 535 
Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He lie Phe Met 
545 550 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
555 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 
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Lys Glu Glu Arc Pro Leu Phe Pro Gin lie Leu Ser Ser lie Glu Leu 
595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 

625 630 635 640 

Thr Ser Pro Arg Leu Pro Val Phe 
645 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
15 10 15 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 110 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 125 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 
130 135 140 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 175 
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Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 



180 



185 



Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 



195 



200 



Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 ZZD 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 



225 



230 



235 



Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 



245 



250 



Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 



260 



265 



Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 



275 



285 



Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala GJn Arg Glu Arg Ala Pro 



305 



310 



Gly Ser Gly Thr Gin Gin Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 
325 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 



340 



Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 

355 360 
His Gly Asp Val Ala Val Lys lie Leu Lys Val Val Asp Pro Thr Pro 



370 



375 



Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 



400 



385 390 

Arg His Val Asn lie Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 



405 



Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 



420 



425 



Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 



435 



440 



Ala 



Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 



450 



455 
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Ile His Arg Asp Met Lys Ser Asn Asn lie Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys lie Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 510 

Ala Pro Glu Val Val Arg Met Gin Asp Asp Asn Pro Phe Ser Phe Gin 
515 520 525 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 
530 535 540 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 555 560 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 570 575 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 590 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 640 

Thr Ser Pro Arg Leu Pro Val Phe 
645 

(2) INFORMATION FOR SEQ ID N0:3: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 10 15 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr lie 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 
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Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 ° u 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
. 85 90 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 

Asp Trp As^ Thr Asp Ala Ala Ser Leu lie Gly Glu Glu Leu Gin Val 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 
130 135 1 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 133 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 I 70 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 

180 1" 
Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 

Z10 215 ZZU 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 " u 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 

260 265 
Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 



305 310 

Gin 
325 



Gly Ser Gly Thr Gin Gin Lys Asn Lys He Arg Pro Arg Gly Gin Arg 
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Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
340 345 350 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 

His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 
370 375 380 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

Arq His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 415 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 430 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu lie Asp He 
435 440 445 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 460 

He His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 510 

Ala Pro Glu Val He Arg Met Gin Asp Asn Asn Pro Phe Ser Phe Gin 
515 520 525 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 
530 535 540 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 555 560 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 570 575 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 ~ 585 590 

Lvs Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 
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Leu His Arg Ala Ala His Thr 61 u Asp He Asn Ala Cys Thr Leu Thr 



Thr Ser Pro Arg Leu Pro Val Phe 
645 



(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Met Glu His He Gin Gly Ala Trp Lys Thr lie Ser Asn Gly Phe Gly 

1 . 5 iW 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 



20 



Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 

35 40 
Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 

50 55 
Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 



100 



Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 



115 



Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 

130 135 
Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His. Cys 
165 ,L ' U 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 
180 1Bb 

Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
ig5 200 £ - UJ 
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Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 220 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 240 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 255 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 270 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 315 320 

Gly Ser Gly Thr Gin Gin Lys Asn Lys He Arg Pro Arg Gly Gin Arg 
325 330 335 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
340 345 350 

Thr Arg lie Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 

His Gly Asp Val Ala Val Lys lie Leu Lys Val Val Asp Pro Thr Pro 
370 375 380 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 415 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 430 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 445 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 460 

He His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 ** 470 475 480 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 
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Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 

500 505 
Ala Pro Glu Val He Arg Met Gin Asp Asp Asn Pro Phe Ser Ser Gin 

515 520 
Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 

530 535 
Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 o/v 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 

Lys Glu Glu Arg Pro Leu ,Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 

610 615 
Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 

Thr Ser Pro Arg Leu Pro Val Phe 
645 

(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 

1 5 , ^ 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 *° 

Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 
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Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu Gin Glu His Lys GTy Lys Lys Ala Arg Leu 
100 105 110 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 125 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 
130 135 140 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 175 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp 5er Asn He Arg Gin 
180 185 190 

Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
195 200 205 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 220 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 240 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 255 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 270 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 315 320 

Gly Ser Gly Thr Gin Gin Lys Asn Lys He Arg Pro Arg Gly Gin Arg 
325 330 335 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
K 340 345 350 

Thr Arg lie Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 " 360 365 
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His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 
370 375 -sou 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 39b 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 HX 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 4 25 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 40U 

He His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 475 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 ^90 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 

Ala Pro Glu Val He Arg Met Gin Asp Asp Asn Pro Phe Ser Phe Gin 
515 520 3 " 

Ser Thr Cys Thr Phe Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 

530 535 t>w 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 570 s/ 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
J 5g5 600 005 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 "5 o^ 

Thr Ser Pro Arg Leu Pro Val Phe 
645 
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(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 
AACTTGTGGT GGTTGGACCT 20 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
TCATCCACAA AGTGATTCTG 20 

(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: . 
AGGAGACCAA GTTTCAGATG 20 



(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 20 base pai 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singl 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
GCGTGCAAGC ATTGATATCC 20 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1947 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



ATGGAGCACA 


TACAGGGAGC 


TTGGAAGACG 


atpappaatg 


GCTTTGGACT 


CAAAGATGCG 


60 


GTGTTTGATG 


GCTCCAGCTG 


CATCTCCCCT 


A PP ATTETTP 
ALLA 1 lul 


AGCAGTTTGG 

nuunu iii ww 


CTATCAGCGC 


120 


CGGGCCTCAG 


ATGATGGCAA 


GCTCACGGAT 


TPTTPTAAGA 


CAAGCAATAC 


TATCCGGGTT 


180 


TTCTTGCCGA ATAAGCAAAG 


GACTGTGGTC 


AATGTGPGGA 


ATGGAATGAG 


CTTACATGAC 


240 


TGCCTTATGA AAGCTCTGAA 


GGTGAGAGGC 


PTGPAGPPAG 


AGTGCTGTGC 


AGTGTTCAGA 


300 


CTTCTCCAGG 


AACACAAAGG 


-r • ■ a* a a mm ft 

TAAGAAAGCA 


PP. PTTAG ATT 


GGAACACCGA 


TGCCGCCTCT 


360 


CTGATTGGAG 


AAGAACTGCA 


AGTGGATTTI 


TTGGATPATG 
I 1 bun 1 L.M I u 


TTCCCATCAC 


AACTCACAAC 


420 


TTTGCTCGGA 


AAACGTTCCT 


GAAGCTTGCA 


TTrTCTfi A PA 
| | L 1 b 1 bMUM 


TCTGTCAGAA 

1 W 1 w 1 wnw • 


GTTCCTGCTA 


480 


AATGGATTTC 


GATGTCAGAC 


TTGTGGCTAC 


A AP I I 1 PATG 
AAbl 1 iLAlu 


AGCACTGTAG 


CACCAAAGTA 


540 


CCTACTATGT 


GTGTGGACTG 


Ma MTa ITdTf* 

GAGTAATATC 


APAPAfSPTPT 
AbALAbL 1 U 1 


TGCTGTTTCC 


AAATTCCACT 


600 


GTTGGTGACA 


GTGGAGTCCC 


AGCACCACCT 


TPTTTPPrAA 


TGCGTCGGAT 


GCGAGAATCT 


660 


GTTTCCCGGA 


TGCCTGCTAG 


TTCCCAGCAC 


A P ATA PTPTA 
Ab A 1 Aw 1 w 1 A 


CACCCCATGC 


CTTCACTTTC 


720 


AACACCTCCA 


GCCCTTCCTC 


AGAAGGTTCC 


PTPTPPPAP.A 


GGCAGAGGTC 


AACGTCCACT 


780 


CCCAATGTCC 


ACATGGTCAG 


CACCACCCTG 


r ATPTCfZAPA 
LA 1 b I UbAwA 


GCAGGATGAT 

UWAUUrl * writ 


TGAGGATGCA 


840 


ATTCGAAGTC 


ACAGTGAATC 


AGCCTCACCT 


TPAPPPPTGT 


CCAGCAGCCC 


AAACAACCTG 


900 


GGTCCAACAG 


GCTGGTCACA 


mm mm ■ a a a f*/* 

GCCCAAAACC 


PPPP.TGPPAG 


CACAAAGAGA 


GCGGGCACCA 


960 


GGATCTGGGA 


CCCAGCAAAA 


« a a m a a a ITT 

AAACAAAATT 


APP.PPTPGTG 


GGCAGAGAGA 


CTCGAGTTAT 


1020 


TACTGGGAAA 


TAGAAGCCAG 


TGAGGTGATG 


PTPTPTAPTP 
L I b I L 1 Aw 1 w 


GGATCGGGTC 

uun i wwww i w 


AGGTTCCTTT 


1080 


GGCACTGTGT 


ACAAGGGCAA 


GTGGCATGGA 


GATGTTGCAG 


TAAAGATCCT 


AAAGGTGGTT 


1140 


GACCCAACTC 


CAGAGCAACT 


TCAGGCCTTC 


AGGAACGAGG 


TGGCTGTTTT 


GCGCAAAACA 


1200 


CGGCATGTTA 


ACATCCTGCT 


GTTCATGGGG 


TACATGACAA 


AGGACAACCT 


GGCGATTGTG 


1260 


ACTCAGTGGT 


GTGAAGGCAG 


CAGTCTCTAC 


AAACACCTGC 


ATGTCCAGGA 


GACCAAATTC 


1320 


CAGATGTTCC 


AGCTAATTGA 


CATTGCCCGA 


CAGACAGCTC 


AGGGAATGGA 


CTATTTGCAT 


1380 


GCAAAGAACA 


TCATCCACAG 


AGACATGAAA 


TCCAACAATA 


TATTTCTCCA 


TGAAGGCCTC 


1440 


ACGGTGAAAA 


TTGGAGATTT 


TGGTTTGGCA 


ACAGTGAAGT 


CACGCTGGAG 


TGGTTCTCAG 


1500 


CAGGTTGAAC 


AGCCCACTGG 


CTCTGTGCTG 


TGGATGGCCC 


CAGAAGTAAT 


CCGGATGCAG 


1560 
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GATGACAACC CGTTCAGCTT CCAGTCCGAC GTGTACTCGT ACGGCATCGT GCTGTACGAG 1620 

CTGATGGCTG GGGAGCTTCC CTACGCCCAC ATCAACAACC GAGACCAGAT CATCTTCATG 1680 

GTAGGCCGTG GGTATGCATC CCCTGATCTC AGCAGGCTCT ACAAGAACTG CCCCAAGGCA 1740 

ATGAAGAGGT TGGTGGCTGA CTGTGTGAAG AAAGTCAAAG AAGAGAGACC TTTGTTTCCC 1800 

CAGATCCTGT CTTCCATCGA GCTGCTTCAG CACTCTCTGC CGAAAATCAA CAGGAGCGCC 1860 

TCTGAGCCTT CCCTGCATCG GGCAGCTCAC ACTGAGGACA TCAATGCTTG CACGCTGACT 1920 

ACATCCCCAA GGCTACCAGT CTTCTAG . 1947 

(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1947 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..1944 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATG GAG CAC ATA CAG GGA GCT TGG AAG ACG ATC AGC AAT GGT TTT GGA 48 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 10 15 

TTC AAA GAT GCC GTG TTT GAT GGC TCC AGC TGC ATC TCT CCT ACA ATA 96 

Phe Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

GTT CAG CAG TTT GGC TAT CAG CGC CGG GCA TCA GAT GAT GGC AAA CTC 144 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 " " 40 45 

ACA GAT CCT TCT AAG ACA AGC AAC ACT ATC CGT GTT TTC TTG CCG AAC 192 

Thr Asp Pro Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

AAG CAA AGA ACA GTG GTC AAT GTG CGA AAT GGA ATG AGC TTG CAT GAC 240 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 
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TGC CTT ATG AAA 6CA CTC AAG GTG AGG GGC CTG CAA CCA GAG TGC TGT 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 

85 90 
GCA GTG TTC AGA CTT CTC CAC GAA CAC AAA GGT AAA AAA GCA CGC TTA 

Ala Val Phe Arg Leu Leu His Glu His Lys Gly Lys Lys Ala Arg Leu 

100 105 
GAT TGG AAT ACT GAT GCT GCG TCT TTG ATT GGA GAA GAA CTT CAA GTA 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 

115 120 
GAT TTC CTG GAT CAT GTT CCC CTC ACA ACA CAC AAC TTT GCT CGG AAG 

Asp Phe Leu Asp His Val Pro Leu Thr Thr His Asn Phe Ala Arg Lys 

130 I 35 
ACG TTC CTG AAG CTT GCC TTC TGT GAC ATC TGT CAG AAA TTC CTG CTC 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 

AAT GGA TTT CGA TGT CAG ACT TGT GGC TAC AAA TTT CAT GAG CAC TGT 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 

AGC ACC AAA GTA CCT ACT ATG TGT GTG GAC TGG AGT AAC ATC AGA CAA 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 

180 185 
CTC TTA TTG TTT CCA AAT TCC ACT ATT GGT GAT AGT GGA GTC CCA GCA 

Leu Leu Leu Phe Pro Asn Ser Thr lie Gly Asp Ser Gly Val Pro Ala 
195 200 

CTA CCT TCT TTG ACT ATG CGT CGT ATG CGA GAG TCT GTT TCC AGG ATG 

Leu Pro Ser Leu Thr Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 

210 215 
CCT GTT AGT TCT CAG CAC AGA TAT TCT ACA CCT CAC GCC TTC ACC TTT 

Pro Val Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 

AAC ACC TCC AGT CCC TCA TCT GAA GGT TCC CTC TCC CAG AGG CAG AGG 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 "U 

TCG ACA TCC ACA CCT AAT GTC CAC ATG GTC AGC ACC ACG CTG CCT GTG 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 
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Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu Pro Val 

260 265 270 

GAC AGC AGG ATG ATT GAG GAT GCA ATT CGA AGT CAC AGC GAA TCA GCC 864 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 285 

TCA CCT TCA GCC CTG TCC AGT AGC CCC AAC AAT CTG AGC CCA ACA GGC 912 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

TGG TCA CAG CCG AAA ACC CCC GTG CCA GCA CAA AGA GAG CGG GCA CCA 960 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 315 320 

GTA TCT GGG ACC CAG GAG AAA AAC AAA ATT AGG CCT CGT GGA CAG AGA 1008 

Val Ser Gly Thr Gin Glu Lys Asn Lys He Arg Pro Arg Gly Gin Arg 
325 330 335 

GAT TCA AGC TAT TAT TGG GAA ATA GAA GCC AGT GAA GTG ATG CTG TCC 1056 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 

340 345 350 

ACT CGG ATT GGG TCA GGC TCT TTT GGA ACT GTT TAT AAG GGT AAA TGG 1104 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 

CAC GGA GAT GTT GCA GTA AAG ATC CTA AAG GTT GTC GAC CCA ACC CCA 1152 

His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 
370 375 380 

GAG CAA TTC CAG GCC TTC AGG AAT GAG GTG GCT GTT CTG CGC AAA ACA 1200 

Glu Gin Phe Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

CGG CAT GTC AAC ATT CTG CTT TTC ATG GGG TAC ATG ACA AAG GAC AAC 1248 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 415 

CTG GCA ATT GTG ACC CAG TGG TCC GAG GGC AGC AGC CTC TAC AAA CAC 1296 

Leu Ala lie Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 

420 425 430 

CTG CAT GTC CAG GAG ACC AAG TTT CAG ATG TTC CAG CTA ATT GAC ATT 1344 
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Leu His Val Gin 61 u Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 m 

GCC CGG CAG ACG GCT CAG GGA ATG GAC TAT TTG CAT GCA AAG AAC ATC 1392 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn lie 

455 HOU 



450 



ATC CAT AGA GAC ATG AAA TCC AAC AAT ATA TTT CTC CAT GAA GGC TTA 1440 

lie His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 

470 H/3 



465 



ACA GTG AAA ATT GGA GAT TTT GGT TTG GCA ACA GTA AAG TCA CGC TGG 1488 

Thr Val Lys lie Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 

485 490 
AGT GGT TCT CAG CAG GTT GAA CAA CCT ACT GGC TCT GTC CTC TGG ATG 1536 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 

500 505 
GCC CCA GAG GTG ATC CGA ATG CAG GAT AAC AAC CCA TTC AGT TTC CAG 1584 

Ala Pro Glu VaT He Arg Met Gin Asp Asn Asn Pro Phe Ser Phe Gin 
515 520 °" 

TCG GAT GTC TAC TCC TAT GGC ATC GTA TTG TAT GAA CTG ATG ACG GGG 1632 
Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Thr Gly 



530 



GAG CTT CCT TAT TCT CAC ATC AAC AAC CGA GAT CAG ATC ATC TTC ATG 1680 

Glu Leu Pro Tyr Ser His He Asn Asn Arg Asp Gin He He Phe Met 

550 553 auu 



545 



GTG GGC CGA GGA TAT GCC TCC CCA GAT CTT AGT AAG CTA TAT AAG AAC 1728 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Lys Leu Tyr Lys Asn 

565 5/0 
TGC CCC AAA GCA ATG AAG AGG CTG GTA GCT GAC TGT GTG AAG AAA GTA 1776 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 

580 585 
AAG GAA GAG AGG CCT CTT TTT CCC CAG ATC CTG TCT TCC ATT GAG CTG 1824 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 600 ou -' 

CTC CAA CAC TCT CTA CCG AAG ATC AAC CGG AGC GCT TCC GAG CCA TCC 1872 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
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610 615 620 

TTG CAT CGG GCA GCC CAC ACT GAG GAT ATC AAT GCT TGC ACG CTG ACC 1920 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 640 

ACG TCC CCG AGG CTG CCT GTC TTC TAG 1947 

Thr Ser Pro Arg Leu Pro Val Phe 
645 

(2) INFORMATION FOR SEQ ID N0:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
15 10 15 

Phe Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

Thr Asp Pro Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu His Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 HO 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 125 

Asd Phe Leu Asp His Val Pro Leu Thr Thr His Asn Phe Ala Arg Lys 
130 135 140 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 175 
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Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 
180 185 

Phe Pro Asn Ser Thr He Gly Asp Ser Gly Val Pro Ala 
200 205 

Leu Thr Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
215 220 

Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
230 235 

Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 255 

Thr Pro Asn Val His Met Val Ser Thr Thr Leu Pro Val 
260 265 270 

Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
280 285 

Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
295 300 

Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
310 315 

Thr Gin Glu Lys Asn Lys He Arg Pro Arg Gly Gin Arg 
325 " 330 335 

340 

Ser Gly Ser Phe Gly Thr Val Tyr Lys 
360 365 



Ser Thr 


Lys 


Leu Leu 


Leu 




195 


Leu Pro 


Ser 


210 




Pro Val 


Ser 


225 




Asn Thr 


Ser 


Ser Thr 


Ser 


Asp Ser 


Arg 


275 


Ser Pro 


Ser 


290 




Trp Ser 


Gin 


305 




Val Ser 


Gly 


Asp Ser 


Ser 


Thr Arg 


lie 


355 


Hi 5 Glv 


Asp 


370 




Glu Gin 


Phe 


385 




Arg His 


Val 


Leu Ala 


lie 


Leu His 


Val 




435 


Ala Arg 


Gin 


450 





Asp Ser Ser Tyr Tyr Trp Glu lie Glu Ala Ser Glu Val Met Leu Ser 



' 375 3 80 

lu Gin Phe Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
390 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 * XZ} 

Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 43 

Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
440 445 

Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
455 460 
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Ile His Arg Asp Met Lys Ser Asn Asn He Phe Leu 
465 470 475 

Thr Val Lys lie Gly Asp Phe Gly Leu Ala Thr Val 
485 490 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser 
500 505 

Ala Pro Glu Val He Arg Met Gin Asp Asn Asn Pro 
515 520 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu 
530 535 540 

Glu Leu Pro Tyr Ser His He Asn Asn Arg Asp Gin 
545 550 555 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Lys 
565 570 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys 
580 585 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser 
595 600 

Leu Gin His Ser Leu Pro Lys lie Asn Arg Ser Ala 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp lie Asn Ala 
625 " 630 635 

Thr Ser Pro Arg Leu Pro Val Phe 
645 



His Glu Gly Leu 
480 

Lys Ser Arg Trp 
495 

Val Leu Trp Met 
510 

Phe Ser Phe Gin 
525 

Leu Met Thr Gly 



He He Phe Met 
560 

Leu Tyr Lys Asn 
575 

Val Lys Lys Val 
590 

Ser He Glu Leu 
605 

Ser Glu Pro Ser 



Cys Thr Leu Thr 
640 
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MWRT IS fTT.AIMED IS: 

1 . A method of identifying an individual at an 
increased risk for developing cancer, comprising: 

amplifying a region of the c-raf-1 gene of said 
individual; 

analyzing products of said amplification for 
evidence of mutation; and 

classifying an individual having one or more 
mutations in said- region as having an increased risk for 
developing cancer. 

2 The method according to claim 1 , wherein said 
region" encodes at least amino acids 514 to 535 of SEQ ID 
NO: 12. 

3 The method according to claim 2, wherein said 
region'encodes at least amino acids 500 to 550 of SEQ ID 
N0:12. 

4 The method according to claim 3, wherein said 
region'encodes at least amino acids 450 to 630 of SEQ ID 
N0I12. 

5. The method according to claim T, wherein said 
products are analyzed by DNA sequencing. 

6 The method according to claim 1 , wherein said 
amplification is effected using a polymerase chain 
reaction (PCR) . 

7 The method according to claim 6, wherein said 
PCR employs a primer comprising SEQ ID NO: 7 and a primer 
comprising SEQ ID NO: 8. 

8. A method for determining a prognosis in 
patients afflicted with cancer, comprising: 
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amplifying a region of the c-raf-1 gene of said 
individual ; 

analyzing products of said amplification for 
evidence of mutation; and 

classifying a patient having no mutation in said 
region as being less likely to suffer disease relapse or 
having an increased chance of survival than a patient 
having one or more mutations in said region. 

9. The method according to claim 8, wherein said 
region encodes at least amino acids 514 to 535 of SEQ ID 
N0:12. 

10. The method according to claim 9, wherein said 
region encodes at least amino acids 500 to 550 of SEQ ID 
NO : 1 2 . 

11. The method according to claim 10 f wherein said 
region encodes at least amino acids 450 to 630 of SEQ ID 
N0:12. 

12. The method according to claim 9, wherein said 
products are analyzed by DNA sequencing. 

13. The method according to claim 9, wherein said 
amplification is effected using polymerase chain 
reaction (PCR). 

14. The method according to claim 13, wherein said 
PCR employs a primer comprising SEQ ID NO: 7 and a primer 
comprising SEQ ID NO: 8. 

15. A method for determining the proper course of 
treatment for a patient afflicted with cancer, 
comprising: 

amplifying a region of the c-raf-1 gene of said 
patient; 
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analyzing products of said amplification for 
evidence of mutation; 

identifying a patient having at least one mutation 
in said region, which patient may require treatment 
proper for a patient having a lesser chance of survival 
or decreased time to relapse; and 

identifying a patient lacking mutations in said 
region, which patient may require treatment proper for a 
patient having a greater chance of survival or being 
less likely to suffer disease relapse. 

16 The method according to claim 15, wherein said 
region encodes at least amino acids 514 to 535 of SEQ ID 
N0:12. 

17 The method according to claim 16, wherein said 
region encodes at least amino acids 500 to 550 of SEQ ID 
N0:12. 

18 The method according to claim 17, wherein said 
region encodes at least amino acids 450 to 630 of SEQ ID 
N0:12. 

19. The method according to claim 16, wherein said 
products are analyzed by DNA sequencing. 

20. The method according to claim 16, wherein said 
amplification is effected using a polymerase chain 
reaction (PCR) . 

21 The method according to claim 20, wherein said 
PCR employs a primer comprising SEQ ID NO: 7 and a primer 
comprising SEQ ID NO: 8. 
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Figure 3 
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Figure 6 



1. Polymerase Chain Reaction (PGR) Amplification of Target DNA 

Either Genomic DNA or cDNA 

Nucleotide 1307 of coding Nucleotide 1326 of coding 

Primer 1 = 5'- AGGAGACCAAGTTTCAGATG -3' 

Nucleotide 1915 of coding Nucleotide 1896 of coding 

i I 

Primer 2 = 5'- GCGTGCAAGCATTGATATCC -3' 



PCR Cycles: 



94 ° C, 1 minute 
55 C, 1 minute 

72 °C,1 minute 
I 



" 1 

Repeat 

35 cycles 



isolate Product 




3. Digestion with 
Sphl and Bglll 




4. Clone into 
c-raf-1 
Containing 
Vector 

i 



4. Clone into 
^ Alternate 
Vector 



3. Direct Sequencing 
Asymmetric PCR 



5. Standard 
Di-Deoxy 
Sequencing 
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